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Abstract 



We introduce a 2-round stochastic constraint-satisfaction problem, and show that its approx- 
imation version is complete for (the promise version of) the complexity class AM. This gives a 
TCP characterization' of AM analogous to the PCP Theorem for NP. Similar characterizations 
have been given for higher levels of the Polynomial Hierarchy, and for PSPACE; however, we 
suggest that the result for AM might be of particular significance for attempts to derandomize 
' this class. 

, To test this notion, we pose some 'Randomized Optimization Hypotheses' related to our 

stochastic CSPs that (in light of our result) would imply collapse results for AM. Unfortunately, 
the hypotheses appear over-strong, and we present evidence against them. In the process we 
show that, if some language in NP is hard-on-average against circuits of size 2 n ("), then there 
exist hard-on-average optimization problems of a particularly elegant form. 
' All our proofs use a powerful form of PCPs known as Probabilistically Checkable Proofs of 

Proximity, and demonstrate their versatility. We also use known results on randomness-efficient 
soundness- and hardness-amplification. In particular, we make essential use of the Impagliazzo- 
Wigderson generator; our analysis relies on a recent Chernoff-type theorem for expander walks. 



1 Introduction 

1.1 Background: PCPs and complexity classes 

A Constraint Satisfaction Problem (CSP) is a collection ip(x) of Boolean-valued constraints over 
variables on a bounded-size alphabet S. A CSP in which each constraint depends on at most k 
variables is called a k-CSP. A natural computational task is to determine the maximum fraction 
of constraints that can be satisfied by any assignment. Cook's Theorem 
problem is NP-complete, and the landmark PCP Theorem of Arora et al. 



Pap94 states that this 



ALM+98] implies that, 



for a sufficiently small constant e > 0, it is NP-hard even to output an estimate that is within e of 
this maximum fraction (where in both results we may take k = 3, £ = {0, 1}). 

Given the importance of the PCP Theorem for complexity theory, researchers have looked for 
analogues of the result for complexity classes other than NP. The PCP Theorem can be seen as 
stating that it is NP-hard to determine within e the value of a 1-player 'solitaire' game defined by 
a 3-CSP. It is equally possible to study games played on a /c-CSP in which 2 players alternate in 
setting values to designated blocks of variables, with one player trying to maximize the fraction 
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of satisfied clauses and the other trying to minimize this fraction. These games were explored in 
several works. Ko and Lin |KL94] showed that approximating the value of such a game is hard for 
the j-th level of the Polynomial Hierarchy, if the game lasts for j moves. In more recent work of 
Haviv et al. [HRTS07] this result was shown to hold even if each variable is allowed to appear in 
at most a constant number of constraints. 

If the game is allowed to last polynomially many rounds, the approximation problem becomes 
PSPACE-hard as shown by Condon et al. [CFLS95] . The same authors showed the approximation 
problem for poly(n) rounds is also PSPACE-hard if a maximizing player plays against a random 
player [CFLS97] (where the game's value now is the expected number of satisfied clauses under 
optimum play by the maximizer). Moreover, all of the hardness-of-approximation results mentioned 
so far are in fact completeness results for the corresponding promise classes, so they can be viewed 
as giving 'PCP characterizations' of NP, PSPACE, and the Polynomial Hierarchy. 

One class that did not receive a PCP characterization based on CSP games was the Arthur- 
Merlin class AM. In fact, there are few known natural complete problems for AM (technically, 
for its promise version, prAM; we don't know if AM, a semantic class, has any complete prob- 
lems. See Sec. 12.31 for the definition of prAM.). To this author's knowledge there is only one 
approximation problem previously known to be prAM-complete: Mossel and Umans [MUQ2] give a 
prAM-completeness result for approximating the VC dimension of set systems. This striking result 
does not fall within the framework of CSP games given above. 

1.2 Our results 

In this paper we present a PCP characterization of prAM. We consider 'stochastic' 2-CSPs ip(r, z), 
where r is a collection of Boolean variables and z a collection of variables over an alphabet X. Let 
Val^(r, z) be the fraction of constraints of ip satisfied by (r, z). In Section [4] we prove: 

Theorem 1. There is a finite alphabet X and a constant e > 0, such that it is prAM -complete to 
distinguish between the following two sets of 2-CSPs: 

^-yes = {"0 : f or a tt r there exists z such that VaL,(r, z) = 1}; 
ILvo = {ip '■ with probability 1 — exp(— f2(|r|)) over random r, Max^ [VaL/, (r, z)] < 1 — e}. 

In particular, this implies that e/2-approximating the value of the 2-round game associated 
with ^)(r,z) (where the first player plays randomly) is AM-hard. 

AM is a class for which we feel such a PCP characterization might be especially important. 
There is compelling evidence that AM = NP, or at least that significant derandomization of AM 
is possible (see |SU07| for an overview of this line of research). One approach to to try and 
derandomize AM is to directly attack the 'easiest' AM-hard problems, and a problem like the one 
provided by Theorem [T] seems like a plausible candidate. 

How might such an attack proceed? We make a concrete suggestion in the form of two 'Ran- 
domized Oracle Hypotheses'. In what follows ifj(r,z) is a 2-CSP over t Boolean variables (r) and 
m variables (z) over a finite alphabet X. 

Hypothesis A (Randomized Optimization Hypothesis for P/poly). Fix any 5 > 0. For 

every 2-CSP if)(r, z), there exists a circuit C^,(r) : {0, 1} — > S m of size 0(~poly(\il>\)), such that with 
probability at least 1/ poly(^) over a random r 6 {0, 1} £ , we have 

VaJ^(r, C^(r)) > Max z [Val^(r, z)\ - 5. 
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In a nutshell, this hypothesis suggests that it is easy to approximately-optimize over z for a 
random choice of r, if we allow our optimizer to depend nonuniformly on the 2-CSP tp. (Such 
nonuniformity is clearly necessary, in light of the PCP Theorem for NP.) This hypothesis, if true, 
would yield a collapse result for AM. In Section [5] we prove the following claim by a straightforward 
application of Theorem [TJ 

Claim 2. Hypothesis A implies AM = MA. 

A strengthened hypothesis could have the stronger implication that AM = NP. Consider the 
following: 

Hypothesis B (Randomized Optimization Hypothesis for NC°). For any 5 > 0, there is an 
integer t = t(5) > such that the following holds. For every 2-CSP ip(r, z), there exists a function 
F^,(r) : {0, 1} £ S" 1 , where each output coordinate of F^ depends on at most t bits of r, and such 
that with probability at least 1 — 5 over r, 

Va^(r,F^(r)) > Max z [Val^(r, z)} - S. 

In Section [5] we prove: 

Claim 3. Hypothesis B implies AM = NP. 

Given the potential consequences of these hypotheses, what chance do they have of being true? 
Unfortunately, it seems that each is unlikely. In Section [6] we prove two results, each to the effect 
that, if NP decision problems are hard on average for exponential-size circuits, then both hypotheses 
fail in a strong way. We state these results next. A language L is called p(n)-hard for size s(n) if 
for every circuit C of size s(n), Pr xg {o,i} n [C{x] = L(x)\ < p(n). 

Theorem 4. Suppose there exists a 71 > and an L € NP n coNP that is (1 — 1/ poly(n))-hard for 
size 2 7in . Then there exists 0,72,$ > and a polynomial-time constructible family {ip n {f,'w)} n> Q 
of 2-CSPs (with \r\ = cn, \w\ = d(n) = 0(poly(n))j ; such that: 
§4\i) for all r, there exists a w such that Val^ n (r,w) = I; 

for all n, if C : {0, l} cn — ¥ {0, l} d ( n ) is a circuit of size at most 2 72 ™, then 

Pr[Val^ n (r,C(r)) > 1 - 9} < exp{-ft(n)}. 

r 

Theorem 5. There is an eq > such that the following holds. Suppose there exists a 71 > 
and an L G NP that is (1/2 + £o)-hard for size 2 7in . Then there exists a c > 0, a polynomial- 
time constructible family {i^ n ( r ,w)} n> o of 2-CSPs (with \r\ = cn, \w\ = d(n) = 0(poly(n))/, and 
72, > 0, such that: 

^i) With probability > 1 — exp{— ft(n)} over r, there exists w with VaL^ n (r, w) = 1; 
IfC: {0, 1}°^ ->■ {0, 1}^™) is any circuit of size at most 2^ n , then 

Pr[Val^ (r,C(r)) > 1 - 9] < exp{-ft(n)}. 
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We note that the hypothesis in Theorem [5] is implied by the hypothesis that there exists a 
balanced function L € NP that is (1 — 1/ poly(n))-hard for some size s(n) = 2^( n ); this follows from 
a result of O'Donnell (O'D02j (see also Healy et al. |HVV06| . where the needed form of O'Donnell's 
result is made explicit and proved in a stronger form). 

Theorems 0] and [5] both say that if NP (or NP n coNP) has sufficiently hard problems, then 
this hardness can be 'concentrated' into a kind of 'inapproximability-on-average' result for an 
optimization problem associated with a single, uniform family of (stochastic) CSPs. Note that the 
two results offer a tradeoff: Theorem [5] gives a slightly weaker conclusion from a presumably likelier 
hardness assumption. The assumptions in the above results are strong but, we feel, plausible. But 
at the very least, these results suggest that the approach we suggested to showing new upper-bounds 
on the power of AM must be modified to have a reasonable chance of succeeding. 

We feel, however, that the Random Optimization Hypotheses are worthy of study in their own 
right, even if they turn out to be false; we pose some concrete questions about them at the end of the 
paper. We also feel that Theorems 0] and [5] are interesting for the study of average-case hardness in 
NP, and that the CSP families they produce might have further applications in complexity theory. 

1.3 Our methods 

All of our three main results-TheoremsHKH and[5]-are essentially hardness results for computational 
tasks associated with 2-CSPs. In each case the reduction with which we prove our result uses a 
powerful type of PCP known as Probabilistically Checkable Proofs of Proximity (PCPPs). PCPPs 
were introduced independently by Ben-Sasson et al. [BSGH+06| and by Dinur and Reingold |DB,06j . 
and the PCPPs we use were developed by Dinur [ Din07| (in |DR06t IDin07j PCPPs are referred to 
as 'Assignment Testers'). In Section [3J we derive a variant form of PCPPs (Lemma [9]) that is more 
useful for our purposes. 

Lemma [9] gives a general reduction (similar to past uses of PCPPs, e.g., in |Din07| ) in which 
we start with a two-argument circuit Q(r,w) and efficiently produce a 2-CSP ip(r,z). The basic 
hope for our reduction is as follows: first, for any r, if the restricted circuit Q(r, ■) is satisfiable (i.e., 
there exists w such that Q(r, w) = 1), then the restricted 2-CSP ip(r, •) should be satisfiable as well. 
Second, if Q(r, •) is unsatisfiable, then any assignment to ip{r, •) should violate an f2(l)-fraction 
of the constraints in ip. Unfortunately, this second requirement is too strong and cannot be met. 
What we can guarantee is that if r is 'far' in Hamming distance from any r' for which Q(r', •) is 
satisfiable, then for any z, (r,z) violates an f2(l)-fr action of constraints of ip. 

How does this reduction help us prove the prAM-hardness result in Theorem QJ Any instance 
x of a promise problem n = (Uyes^no) defines a predicate Q(r,w) computed by a poly-size 
circuit. If x £ Hyes then for all r, Q(r, ■) is satisfiable; while if x £ n^vo then for a 2/3 fraction 
of r, Q(r, •) is unsatisfiable. In order to apply our reduction, we need a stronger condition in the 
second case: a random choice of r should be far from any r' for which Q(r', •) is satisfiable. In other 
words, we need an extremely low error probability in our underlying Arthur-Merlin protocol. This 
cannot be achieved by straightforward parallel repetition, but it is provided by a theorem of Bellare 
et al. |BGG93j which gives a randomness-efficient soundness-amplification for AM. Interestingly, 
Mossel and Umans [MU02] also used such amplification for their AM-hardness-of-approximation 
result on VC dimension, but for rather different reasons (unrelated to PCPs). 

Our prAM-hardness proof is, we feel, more straightforward than the existing proofs of the 
analogous results for PS PACE and the Polynomial Hierarchy, modulo our use of sophisticated tools 
(PCPPs and efficient soundness-amplification) which we apply in a 'black-box' fashion. Of course 
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we do not rule out that our result could be also proved more directly by adapting ideas from the 
earlier papers (which use some of the same property-testing ideas that have gone into constructions 
of PCPPs). But we feel that PCPPs in particular, which have already found applications in PCP 
construction, coding theory, and property testing (see [BSHLM08J for an overview), are a versatile 
tool which could be of more widespread use in complexity theory. In a very recent example of their 
utility, Williams [Wil] applied PCPPs to the study of circuit lower bounds. 

Next we discuss our methods in Theorems H] and [5j Our transformation in Lemma [9] from the 
circuit Q to the 2-CSP ip has a further useful property: we can reduce the problem of finding 
satisfying assignments to Q(r, •), to the problem of finding nearly-optimal assignments to ip(r,-). 
Roughly speaking, we show the following. Suppose there is an algorithm P(r) producing an assign- 
ment z, such that with some probability p over r, (r, P(r)) satisfies 'almost all' of the constraints 
of ip; then there is a second algorithm P(r) such that Q(r,P(r)) = 1 with probability p' > 2~ £ \ r \p 
(where e > can be chosen arbitrarily small). This property of the reduction is somewhat more 
novel, although the techniques we use (involving error-correcting codes) still follow previous works. 

To apply our reduction, we use the hardness assumptions in Theorems H] and [5] to produce 
predicates Q(r,w) such that Q(r, ■) is satisfiable with high probability, while any 'small' witness- 
producing circuit C fails to solve the search problem associated with Q: that is, Q(r,C(r)) = 
with high probability. Because of the exponential loss factor 2 _£ l r l in our reduction, we need the 
search problem associated with Q to be extremely hard: we need every 'small' circuit C to succeed 
with probability at most exp{— J7(|r|)} over r in achieving Q(r,C(r)) = 1. 

To produce such extremely hard search problems from a more 'mild' hardness assumption, we 
use existing hardness-amplification techniques. In particular, we use the well-known Impagliazzo- 
Wigderson generator jTW97]. This generator, on input parameter n, takes a seed r of length 
0(n), and produces re 'pseudorandom' outputs g%, . . . ,g n each of length re. The generator has the 
property that if language L is mildly hard for sufficiently small (but exponential-size) circuits, then 
any sufficiently smaller circuit has success probability < exp{— Q(n)} in correctly guessing the re- 
bit string (L(gi), . . . , L(g n )). Then, if our hard language L is in NP n coNP (as in Theorem H]), 
defining our predicate Q is straightforward: we let Q(r,w) = 1 iff w contains 'proofs' for the re 
values (L(s-i),...,L(c/ n )). 

If our hard language is merely in NP (as in Theorem [5]), we need to work harder. In this case, 
we let Q(r,w) = 1 iff w contains proofs that L(gi) = 1, for a 'sufficient number' of the strings gi. 
The idea is that if a small circuit C{r) could with some noticeable probability guess such proofs 
for 'almost all' the indices i for which L(gi) = 1, then C could be modified to correctly guess 
(L(gi), . . . , L{g n )) with noticeable probability, contrary to the properties of the generator. Making 
this idea work involves showing that the set size \{i £ [re] : L(gi) = 1}| is highly concentrated around 
its expectation. For this we rely on a recently proved concentration result called the Strong Chernoff 
Bound for Expander Walks |WX05^ IWX08} IHea08] . This result is perfectly suited to analyze the 
Impagliazzo- Wigderson generator (which is partly defined in terms of walks on expander graphs) . 

The precise form of our assumptions in Theorems [4] and [5] are dictated by the hardness- 
amplification tools currently available. In particular, sufficiently strong hardness-amplification is 
only available if we make a hardness assumption against nonuniform, exponential-sized circuits. 
We believe versions of Theorems S] and [5] should be possible for a uniform hardness assumption; re- 
cently Impagliazzo et al. [IJKW08I made partial progress towards the hardness-amplification tools 
needed. 
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2 Preliminaries 

2.1 Basic definitions 

We presume familiarity with basic notions in complexity theory, in particular familiarity with the 
classes P, NP, and AM. We define promise classes and the promise class prAM in Section [2.31 

For a language L C {0, 1}*, we use L(x) to denote the characteristic function of L. We use |x| 
to denote the length of a string x over some (possibly non-Boolean) alphabet S. d(x, y) denotes 
the Hamming distance between strings x, y £ S n , and d(x, S) is the generalized Hamming distance 
between x € H n and a set S C £ n . If d(x, S) < c we say x is c-close to S, otherwise x is c-far from 
S. Similarly, for a E [0, 1], if d(x,S) < an we say x is a-close in relative distance to S, otherwise 
x is a-far in relative distance from S. 

H{t) : [0, 1] — > [0, 1] denotes the binary entropy function, H(t) = —tlogt — (1 — t) log(l — t) for 
t G (0, 1) and H (0) = H(l) = 0. We let V n ^ denote the discrete volume of the Hamming sphere of 
radius k in {0, 1}™; that is, 




and we use the known bound V n , an < 2 H ^ n (valid for a G [0, 1/2]). 

When we speak of circuits, unless otherwise mentioned we mean deterministic Boolean circuits 
of fanin-two, and we measure circuit size (denoted |C| for circuit C) as the number of gates. For 
functions p(n) E [0, 1], s(n) > we say that a language L is p(n)-hard for size s(n) if for every 
Boolean circuit C of size < s(n), Pr xg {o,i}« [C{x) = L(x)} < p(n). We extend this definition to 
general functions: we say that a function F : {0, 1}™ — > {0, l} m is p(n)-hard for size s(n) if for 
every m-output Boolean circuit C of size < s(n), Pr xe {o,i}« [C(x) = F(x)} < p{n). 



2.2 CSPs, PCPPs, and codes 

Fix an integer k > 1. A k-local Constraint Satisfaction Problem, or k-CSP, over finite alphabet S is 
a collection tp{x) = ipi(x), . . . ^ m (x) of Boolean-valued functions on the input x = (x\, . . . x n ) £ S n , 
where each ipj depends only on some k variables of x and is specified by a fc-tuple Ij C [n] and a 
truth-table on these k variables. Define Val^,(x), the value of tp on x, as the fraction of constraints 
ijjj that are satisfied by x (i.e. such that ipj(x) = 1). 

Next we define PCPPs. Fix a circuit C(x) on n Boolean input variables, a finite alphabet S, 
and a parameter (3 > 0. We say that a fe-CSP ip is a PCPP for C over £ with security (3 if: 

1. ip is defined on variable set (x,z), where x are the Boolean input variables to C and z are 
auxiliary 'proof variables taking values in S; 

2. For any x € {0, l} n , if C(x) = 1 then there exists a setting of z such that Val^p(x, z) = 1; 

3. For all x G {0, l} n and z, VaL^x,^) < 1 - ■ d(x ' C n . 

The proof size of ^ is the number of variables in z. 

The following positive result on PCPPs is due to Dinur. 



Theorem 6. \Din01 , Cor. 9.3] There is a constant-size alphabet Sq; a constant (3 > 0, and a 



polynomial-time algorithm that, given a circuit Q(x) of size t, produces a 2-CSP iPq(x,z) that is a 
PCPP for Q over Sq with security f3. Moreover, the proof size of ip is 0(poly(t)). 
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Following the techniques of earlier papers working with PCPPs, we will use PCPPs in conjunc- 
tion with efficient error-correcting codes. A (binary) code is an injective map E : {0, 1}^ -)• {0, 1}^' 
where N' > N. We also use E to denote the image of the map, i.e., we consider E C {0,1}^'. The 
minimum distance of the code is the minimum over distinct u, v £ E of d(u,v). An algorithm D 
decodes E from an r] fraction of errors if, given any string u at relative distance at most rj from 
some v! 6 E, A(u) outputs u' . Note that for such decoding to be possible, the minimum distance 
must be greater than 2r]N' . 

We will use the following well-known fact: 

Theorem 7. There is a polynomial-time computable code E for all input lengths N with output 
length N' = O(N), and an rj > 0, such that E can be polynomial-time decoded from an rj fraction 
of errors. 

Many such constructions are known; recently Goldwasser et al. [GGH + 08] gave a construction in 
which the decoder algorithm can be implemented in AC , i.e., with constant-depth, polynomial-size 
Boolean circuits. 

2.3 Promise problems and prAM 

A promise problem is a pair II = (Hyes^no) of disjoint subsets of {0,1}* (the 'yes' and 'no' 
instances, respectively). For a function s(n) € [0, 1], we say that (Hyes^no) £ P r ^^i,s(n) if there 
exists a polynomial-time randomized algorithm M(x,r,w), with |r|, \w\ = 0(poly(n)) such that: 

1. (Completeness) If x S ITy^s, then with probability 1 over the random string r, there exists 
a w = w(r) such that M(x,r,w) = 1; 

2. (Soundness) If x € IIno and |x| = n, then the probability over the random string r that there 
exists a w such that M(x,r,w) = 1, is at most s(n). 

The algorithm M defines an 'Arthur-Merlin protocol': we consider that a polynomially bounded 
verifier Arthur chooses a random 'challenge' r for the computationally unbounded Merlin, who sees 
r and gives a response w which Arthur accepts or rejects.. 

We define prAM = prAM lfl / 3 . A promise problem 111 = (Hyes^no) is prAM-hard if for 
all IT' = (U YES ,H' NO ) in prAM, there exists a polynomial-time computable reduction R(x), such 
that, if x G TfyESi then R(x) £ Uyes, while if x G II^q, then R(x) £ ITvo. We say that II is 
pr AM -complete if II is in prAM and is prAM-hard. 

It is not hard to see that for any II G prAM, the soundness parameter 1/3 in the protocol can be 
made exponentially small in n, by parallel repetition of the original protocol. However, we require 
soundness-amplification that is more efficient in its use of randomness. This is provided by a result 
of Bellare et al. [BGG93J. They state their theorem for AM, not for prAM, but the proof carries 
over without changes to the promise setting and we state it for this setting. 

Theorem 8. [BGG93] Let W = (Uyes^no) £ prAM ; where M(x,r,w) is a polynomial-time 
predicate defining an Arthur-Merlin protocol for IT. Let n = \x\, and fix a polynomial m(n). Then 
there exists an Arthur-Merlin protocol for U defined by a polynomial-time predicate M'(x,r' ,w'), 
with \w'\ < 0(poly(n)), \r\ < \r'\ < 0(\r\ +m(n)), and with soundness 2~ m ( n \ 

The randomness-efficiency in the above result has been improved in more recent work (see [MU02| 
for a discussion), but we do not need or use these improvements. 
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2.4 AM-A>CSPs 



By an AM-k-CSP we mean a fc-CSP ip(r, z), where r are Boolean and z may be non-Boolean. We 
call r the 'Arthur- variables' and z the 'Merlin- variables'. Informally speaking, we are interested in 
the game in which the r are first set uniformly by Arthur, and then Merlin sets z to try to maximize 
the fraction of constraints of tp satisfied by (r, z) . 

For any fixed k > 1, soundness parameter s = s(\r\) G [0,1], alphabet E, and fixed e G (0,1], 
we define the promise problem Gap - AM - k - CSP 11 _ es( | r | ) = {Uam-CSP,yeS,^AM-CSP,no) as 
follows. Both 'yes' and 'no' instances are AM-A:-CSPs over E. If ip(r,z) £ ^-AM-CSP,YESi we are 
promised that for all choices of r, there exists a z such that Val^(r, z) = 1. If ip(r, z) G ^-am-CSP,NOi 
we are promised that only for at most an s(|r|) fraction of strings r does there exist a z with 
Val^,(r, z) > 1 — e. 

3 An Augmented PCPP 

As a tool for proving Theorems [H [U and \5\ we prove the following 'augmented' version of the 
PCPP Theorem (Theorem [6]), which we derive from Theorem [6l We remark that the proof of our 
prAM-completeness result (Theorem [T]) uses only condition ([9li) of the Lemma below; this first part 
of the Lemma is quite similar to previous uses of PCPPs. Also, our use of error-correcting codes 
will only be important for establishing condition dUii). 

Lemma 9. There is a finite alphabet Eo such that the following holds. For any e > there is 
a v > and a polynomial-time algorithm A that takes as input a Boolean circuit C = C(r,w). 
A outputs a 2-CSP ip(r,z), where \z\ = 0(poly(|C|)) and the variables of z are over So- Letting 
£ = |r|, tp has the following properties: 

For all r, if there is a w such that C(r,w) = 1, then there is a z such that VaWr, z) = 1. 
On the other hand, if r is al-far from any r' for which C(r', •) is satisfiable, then for all z, 
Val^(r, z) < 1 — fi(a). 

^ii) Suppose P{r) is any (possibly randomized) procedure such that with probability at least p = p(l) 
over a uniform r G {0, l} e and any randomness in P , P{r) outputs a z such that Val^,(r, z) > 
1 - v. 

Then there exists a deterministic procedure P(r), such that with probability at least p(£) -2~ ei 
over uniform r, P(r) outputs a w such that C(r,w) = 1. Moreover, P{r) is computable by 
a nonuniform, poly (|C|) -sized circuit that makes a single oracle call to P on the same input 
length (with P's randomness fixed nonuniformly). 

Proof. Let := \w\. We can assume, by padding w if necessary, that N > i. Let E : {0, 1}^ — > 
{0, 1}^' be the error-correcting code given by Theorem [7] (with N' = 0(A)), applied to inputs of 
length N. 

Let b = b(£) := [^-~|. Define a predicate Q(n,r2, . . . , r&,«), with \ri\ = \r\ = i for i < b and 
\u\ = N', by the following rule: Q(r±,r2, ■ ■ ■ r^, u) = 1 iff r± = T2 = ■ ■ ■ , = rj,, u = E{w) for some w, 
and C(n,w) = 1. Clearly we can efficiently construct a circuit of size 0(poly(|C|)) computing Q. 
Note that by our setting of b, there are more variables in the blocks rj than in u. 

Let ipo = iPq((vi, . . . , Tb, u), Z) be the PCPP 2-CSP for Q over alphabet So given by Theorem El 
efficiently constructible and of size 0(poly(|C|)). We take ipo and make two changes. First, we 
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substitute the variables of r for the corresponding variables of each vector r,. Second, we allow the 
variables of u to range over all of So (we may assume {0, 1} C So), and modify each constraint to 
reject in the case where one or more of its u-variables are set to a non-Boolean value. 

We denote the resulting 2-CSP by ip(r, u, Z). We claim that this efficiently constructible 2-CSP 
satisfies the conditions of Lemma [9fs statement, with z := (u, Z) and So as in Theorem [6l Our 
setting of v > will be determined later. 

First, we show that condition ([9li) is satisfied. Consider any r € {0,1}^. Suppose that there 
exists w € {0,1}^ such that C(r,w) = 1. Then Q(r,r,... ,r,E(w)) = 1. Using the completeness 
property of PCPPs, there exists a Z such that Val^,((r, E(w)), Z) = 1. On the other hand, say 
r is a£-far from any r' for which C(r', •) is satisfiable. Given any u £ S^ , let us choose some 
string v! € {0, 1}^ which agrees with u on any variable where u is Boolean. We observe that 
(r, r, . . . ,r,u') is a/2-far in relative distance from By the soundness property of PCPPs, 

for any choice of Z, VaL/,((r, u'), Z) < 1 — where (3 > is the constant from Theorem Also, 
by the way we defined tp, Val^,((r, u), Z) < Val^((r, u'), Z). We have verified condition (J9ji). 

Now we turn to condition (J9jii). Let P(r) be as described in ([Hii). Note that by averaging, 
we may fix (nonuniformly) some value of the randomness used by P while preserving the lower- 
bound p(£) on its success probability over the choice of r; we do so and consider P a deterministic 
algorithm from now on. We set v := 77/37/4, where r/ is the constant in Theorem [71 /3 is the constant 
in Theorem [6l and 7 6 (0, 1) is a small constant to be announced. 

Let P'(r) be the procedure that, on input r, computes z = P(r) = (u, Z) and runs the 
polynomial-time decoder for E on u, yielding a string w E {0, 1}^. Let P' output w. 

We analyze the behavior of P'. Let z = (u, Z) be any output of P{r) such that Val^,(r, z) > 1 — u. 
By the soundness property of PCPPs, the string (r, r, . . . , r, u) must be ^ = ^-close in relative 
distance to some string (n, . . . , r^, v!) for which Q(ri, . . . , r^, u') = 1 (and thus r% = ...=■ and 
v! € E). Since = N' < b(£) ■ \r\ < 2N', we find that d(r,ri) < ^£ and d(u,u') < rjN'. The latter 
inequality implies that when P' applies the polynomial-time decoder to u, it correctly recovers 
w = E~ l {u'). Since Q(r±, . . . , r&, u') = 1, we have C{r\,w) = 1. 

To analyze P, say that a string r € {0, l} n is good if P'(r) outputs a w such that there exists 
an r' at distance at most "fl from r, such that C(r',w) = 1. Our analysis of P', combined with 
our original assumption about the success probability of P, shows that at least a p{t) fraction of 
strings r are good. 

Now we define the procedure P(r): P(r) first chooses a vector v G {0, 1}' uniformly from the 
set of all strings of Hamming weight at most 7?, then outputs P'(r + v). Note that, if r is selected 
uniformly, r + v is also uniform and, after conditioning on its value, r is uniformly distributed over 
all strings at distance at most *y£ from r + v. Thus, conditioning on r + v being good, we have 
at least a l/V^i > 2~ H ^ i chance that C(r,P'(r + v)) = 1. So the overall success probability of 
P(r) is at least p(£) ■ 2^ H ^ i . Since #(7) — > as 7 — > 0, we may choose 7 > so that the success 
probability is at least p(£) ■ 2~ ei . 

P' is clearly a polynomial-time algorithm making one call to P, while P simply makes one call 
to P' after its random sampling and bitwise addition mod 2. The choice of v can be nonuniformly 
fixed in a way that does not decrease the success probability, so P can be implemented with the 
resources claimed. Thus we have verified condition ((Uii), completing the proof of the Lemma. □ 
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4 PCP Characterization of prAM 



In this section we prove Theorem [H which we restate in the terminology of Section 12.41 

Theorem Q] (restated). There is a finite alphabet S and a constant e > 0, such that 
Gap — AM — 2 — CSP;l ]_ e exp /_ n(\r\)} is pr AM -complete. 

Proof. First, we claim that for any s(\r\) = o(l) and e > 0, Gap — AM — 2 — CSPj_ 1—e s /| r ij = 
{Ham—CSP,yeSj H-AM-CSPfNo) is i n prAM. The protocol is as follows: given a 2-CSP ip(r, z), Arthur 
picks r uniformly and Merlin responds with a setting of z. Arthur accepts iff Val^(r, z) = 1. If 
ij) G ^am~csp,yes, then clearly Arthur accepts with probability 1 when Merlin responds optimally. 
If ip G nAM-C^JVO) then Arthur accepts with probability at most s(|r|), which is greater than 
2/3 for large enough \r\. (For instances with |r| below this threshold, Arthur can simply request 
certificates z(r) for every setting of r and verify that each satisfies Val^,(r, z(r)) = 1.) 

Thus our main task is to show that the promise problem is prAM-hard, for appropriate choice of 
parameters. Let II = (Hyes, IIjvo) € prAM, and let M±(x, ri,w±) be a polynomial-time-computable 
predicate defining an Arthur-Merlin protocol for EL We use parameters n = \x\,£i(n) = |r|; by 
definition of prAM we have £\{n) = 0(poly(n)) and \w\\ = 0(poly(n)). By padding r\ if necessary 
we may assume £i(n) > n. Apply Theorem[8]to M%, with the setting m(n) := £\{n). Thus we get 
a new Arthur-Merlin protocol M2{x,r2,W2) for IE, with \r%\ = £-2(n) G [n, . . . ,D ■ £i(n)} (for some 
fixed D > 0), \u)2\ = 0(poly(n)), and with soundness 2~ £l ^ n \ 

Given an input x G H-yes U EIatO) w e construct a poly(n)-sized circuit C(r2,W2) = C x (r2,W2) 
that accepts iff M2(x,r2,W2) = 1. To this circuit we apply the algorithm A of Lemma [9] (with a 
setting of e > to be announced), yielding a 2-CSP ?/> = ip(r2,z) which we make the output of our 
reduction. 

We show the correctness of the reduction. First, suppose that x G Hyes- Then for each choice 
of T2, there exists a W2 such that M2(x,r2,W2) = 1. By condition ([9li) of Lemma EJ there exists z 
such that Val^(r 2 ,z) = 1. Thus -0 G EI^Af -csp, Y es- 

Now suppose that x G IIjvo- Then by the soundness property of M2, the number of strings r 2 
for which M 2 {x,r 2 , •) is satisfiable is at most 2~ 1 ^ • 2 l ^ n) < 2 (1 "i^ 2(n) . Thus the number of r 2 
for which there exists an r' at distance < a^ 2 ( n ) from r 2 , such that M 2 (x,r', •) is satisfiable, is at 
most 

Choosing a > such that -ff(a) < ^, we find that with probability > 1 — exp{— il(^ 2 (n))} over a 
uniform choice of r 2 , r 2 is a£ 2 (n)-far from any r' such that C(r', •) is satisfiable. For such r 2 and 
for any z, condition (|9j i) of Lemma [9] tells us that Val^,(r 2 , z) < 1 — 0(a). 

Thus if we fix e > as an appropriately small constant and choose an appropriate s(|r 2 |) = 
exp{— J7(|r 2 |)}, we have ip G Ham-csp,no- This completes the proof of correctness for our reduc- 
tion. □ 

5 Randomized Optimization Hypotheses Imply Collapse of AM 

What significance might Theorem [H our 'PCP characterization of AM', have for the project of 
trying to prove new upper bounds on the power of this class? In the Introduction we gave two 
hypotheses inspired by TheoremQ] Each of these hypotheses, if true, would have major implications 
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for the study of AM; this is the content of Claims [2] and [3] from the Introduction, which we prove 
next. 

Proof of Claim\^ Let L G AM; then (L, L) € prAM. Given an instance x, let Arthur run the reduc- 
tion in Theorem[T]on input x, producing a 2-CSP ip(r, z). Let Merlin send Arthur a polynomial-sized 
circuit C : {0, l} e — > {0, l} m , with 5 := e (here e is from Theorem [T]). Then Arthur runs C on a 
sufficiently large (0(poly(n))) number of random choices of r, accepting only if he finds an r such 
that Val^(r,C(r)) > 1 - e. 

First suppose x € L; then by the completeness property of our reduction, for all r there exists 
a z for which VaL/,(r, z) = 1. If Merlin sends the circuit assumed to exist by Hypothesis A, 
then with at least 1/ poly (|€|) probability over r, Val^,(r, C(r)) > 1 — e. So if Arthur samples a 
sufficiently large (polynomial) number of strings r, Arthur will accept with probability > 2/3. 

Next suppose x ^ L; then our reduction guarantees that for all but an exponentially small 
fraction of strings r, for all z Val^(r, z) < 1 — e. So Arthur's acceptance probability is negligible no 
matter what circuit Merlin sends. Thus we have an MA protocol for L. □ 

Proof of Claim\^ We apply Hypothesis B with 5 := e/3, yielding a value t = t(5). Let L £ AM be 
given, and let Arthur run the reduction from Theorem [1] on input x, yielding an instance ip(r,z). 
Let Merlin send a description of a function F(r), where each output of F depends on at most t bits 
of r (note F can be described in polynomial size). Arthur performs explicit variable-substitutions 
z = F(r) in ip and uses linearity of expectation to exactly compute E r [Val^(r, F(r))]. 

If x E L and Merlin sends F^p as given by Hypothesis B, this expectation is at least (1 — 5) 2 > 
1 — 2e/3. On the other hand, if x ^ L then, regardless of the function sent, this expectation is 
at most (1 — e) + exp{— f2(|r|)}. Thus for |r| large enough we can distinguish the two cases. (If 
|r| is below a fixed threshold, Arthur can instead request that Merlin send optimal values z(r) for 
each r.) Arthur's computations are deterministic and polynomial-time, so the above defines an NP 
protocol for L. □ 

Note that Claim [3] would hold even if we weakened Hypothesis B, allowing each coordinate 
of F(r) to depend on t(5,n) = O^logn) coordinates. We state Hypothesis B in a stronger form 
because, although we believe it is false, we don't know how to disprove it unconditionally even in 
the form given. 

6 Evidence Against the Randomized Optimization Hypotheses 

Next we use LemmaEl in conjunction with known results about amplification of hardness, to prove 
Theorems S] and That is, under various complexity-theoretic assumptions, we exhibit families 
of 2-CSPs ip{r, z) for which it is hard on average to approximately optimize over z, for randomly 
chosen r. As mentioned earlier, the conclusions of both Theorems are easily seen to falsify both 
of our Randomized Optimization Hypotheses, and we consider this evidence that these hypotheses 
are probably false. 

First, amplification of hardness in NP n coNP from (1 — 1/ poly(n))-hardness to 2/3-hardness is 
made possible by the following result of Impagliazzo |Imp95, essentially Thm. 2]: 

Theorem 10. [Imp95] Suppose that there exists a language L, a function s(n), and a c > 0, such 
that L is (1 — ^s)-hard for size s(n). Then for any d > 0, there exists another language L' such 
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that V is (i + 0(-±r))-hard for size oil) • Moreover, JJ is polynomial-time truth-table reducible to 
L. 

Lemma 11. Suppose that there exists a language L G NP n coNP and 7, c > such that L is 
(1 — ^c)-hard for size 2 in . Then there exists another language L' G NP n coNP and a 7' > such 
that L' is 2/3-hard for size 2 7 ' n (for sufficiently large n). 

Proof. Apply Theorem 1 101 with s{n) := 2" /n and with any d > and 7' G (0,7), and use the fact 
that NP n coNP is closed under polynomial-time reducibilities, i.e., p NPncoNP = |\|P n coNP. □ 

Next, the Impagliazzo-Wigderson pseudorandom generator |IW97j allows us to amplify 'mod- 
erate' hardness of the type produced by Lemma Qj] into 'extreme' hardness, albeit of a function 
problem rather than a decision problem (in [IW97J additional techniques are used to produce ex- 
tremely hard decision problems, but we do not follow this path). The next definition follows [IW97] 
(and previous works). Given a language L, an integer c > 1, a parameter k = k(n), and a function 
G(r) : {0, l} cn {0, l} kxn (called a 'generator' function), define L k o G : {0, l} cn -> {0, l} k by 

(L k o G)(r) := (L(Gi(r)), L(G 2 (r)), . . . , L(G k (r))) , 

where the string G(r) is divided into k blocks G\(r), . . . , G^(r), each of length n. The basic idea is 
that if G(r) is appropriately 'pseudorandom', then the collection G%(r), . . . ,Gk(r) should behave 
in important respects like a truly independent collection of random strings. In particular, if it is 
somewhat hard to compute L(x) for a random x, it should be very hard to compute (L k o G)(r) 
correctly when k is large. 

The following result (a restatement of |IW97l Thm. 2.12]) gives the main hardness-amplification 
property of the generator defined in that paper, which we denote Gjw- 

Theorem 12. \IW9T$ For any 7 > ; there are 7', c > 0, and a polynomial-time computable 
G m : {0, l} cn -> {0, i} n * n , such that: if L is 2/3-hard for size 2^ n , then (L n o G IW ){r) is 
2~7 ™./j arc ; f or s i ze 2 7 n . 

(Recall our definition of average-case hardness for general functions from Section [2.11 ) 

Proof of Theorem^ We begin by applying Lemma [TT1 to our language L G NP n coNP, yielding a 
language V G NP n coNP that is 2/3-hard for circuits of size 2 7 ° n for some 70 > 0. Then we apply 
Theorem [12] to L'; we derive a 7' > 0, such that {{L') n o Giw)(f) is 2~ 7 "-hard for size 2 7 n . 

Since U G NP n coNP, there exists a polynomial-time witness predicate M(x,w), producing 
outputs from {0,1,?}, satisfying: 

1. for all (x,w),M(x,w) G {L'(x),?}; 

2. for all x, there exists a w such that M(x,w) = L'(x); 

3. \w\ = 0(poly(n)). 

Let t(n) = \w\. We reformat M if necessary to ensure that the first bit of w consists of a 'claim' 
bit, call it w c i, such that for any (x,w) with M(x, w) = L r (x), we have w c i = L'(x). Next we 
define M'(x,w), which outputs 1 if M(x,w) G {0,1}, otherwise. M' is also polynomial-time 
computable. 
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Define a predicate Q(r, w\, . . . , w n ) : {0, l} m x{0, l} nx *( n ) — >. {0, 1} as follows: Q(r, w\, . . . , w n ) = 
1 iff for all i £ [n], M' {Giw,i{r),Wi) = 1. Q is polynomial-time computable since Giw and M' 
are, so let Q n be a 0(poly(n))-sized circuit for Q on input parameter n. Clearly Q n is efficiently 
constructible. 

We claim that Q defines a hard-on-average search problem. To see this, suppose C(r) : 
{0, 1} C " —7- {0, i} nx *( n ) i s any circuit of size at most 2 7 n which has some p(n) probability over 
r of outputting a collection wi,...,w n for which Q(r, w\, . . . , w n ) = 1. Then we may construct 
a circuit C(r){0, l} cn — > {0, l} n that simply restricts the output of C(r) to the 'claim' bits of 
the strings wx,...,w n that C produces. Observe that C'(r) has a p(n) chance (over r) of cor- 
rectly outputting ((L') n o Giw){f)- Moreover, C'(r) also has size bounded by 2 yn . We conclude 
p(n) < 2~T' n . 

We invoke Lemma [9] with e := r y'/(2c), yielding a poly-time algorithm A (and an associated 
v > 0). We apply this A to Q n , yielding a 2-CSP ip n (r,z) (here \z\ = d(n) = 0(poly(n))). We 
claim that the 2-CSP family {ip n (r, z)} n> o satisfies the conditions of Theorem HI 

To see this, first note that for all r, M'(r, •) is satisfiable; so, there exists wi, . . . ,w n such that 
Q n (r, w\ , . . . , w n ) = 1. Thus by condition ([9ji) of Lemma[U there exists z such that Val^, n (r, z) = 1. 
So condition (jUi) is satisfied. 

To establish condition glii), let 72 := e and (9 := v. Suppose C(r') : {0, l} cn {0, 
is a circuit of size at most 2 72?l , such that with some probability q(n), Val^„(r', C(r')) > 1 — 0. 
By condition ([Hii) of Lemma [U there exists a circuit C(r) : {0, l} cn — > {0, l} nx *( n ) ; such that 
with probability at least q(n) ■ l~ e<yCn ^ over r, Q n (r,C(r)) = 1. Moreover, (7 is of size at most 
|C| + 0(poly(n)), which for large enough n is less than 2 7n . By our previous analysis we find that 
q{n) ■ 2" £ ( cn ) < 2-t'", i.e., q(n) < 2^ c ~"<> = 2~l' n / 2 = exp{-Sl(n)}. We have proved condition 
(JUii), for our settings of 72, □ 

Next we turn to prove Theorem For this, we need a more detailed analysis of the generator 
Giw- Besides the hardness-amplification property summarized in Theorem 112^ Giw has another 
useful property: with very high probability over r, the fraction of the strings Giw,i( r ), ■ ■ ■ , Giw,n(r) 
which are in L is close to \L n {0, l} ?l |/2 ri , that is, close to the fraction we'd expect if these strings 
were drawn independently and uniformly. To prove this fact (not proved or used in |IW97| ) . we 
first describe the generator in more detail. 

The input r to Giw consists of two parts, r = (r a ,r^). G/v^(r a ,rft) is defined blockwise for 
i G [n] as 

Giw,i{r a ,n) = Ki(r a ) + i^'(r 6 ), 

with Ki : {0, l}l r ' a l {0, l} n , K[ : {0, l}l r '"l -> {0, l} n , and with + denoting bitwise addition mod 
2. The definition of K[ is not important to us; let us describe the functions Ki. The string r a 
defines a random walk of length n (counting the starting vertex) on an explicit expander graph Q n 
with vertex set {0, l} n . Q n is 16-regular with normalized second eigenvalue A n at most some fixed 
A < 1. Vi = Ki(r a ) represents the i-th vertex visited in this walk. v\ is a uniform element, and 
each subsequent step Vj+i is a uniform choice from among the neighbors of v^. (Note that this can 
be achieved with \r a \ = O(n) random bits as claimed.) 

We will use the following powerful result, called the Strong Chernoff Bound for Expander Walks, 
proved by Healy [Hca08j . 

Theorem 13. IHeaOS^ Let G = (V, E) be a d-regular graph with second eigenvalue \, let m > 0, 
and let fi,...,f m : V — > [0,1] have expectations /ii,...,/i m (over a uniform choice of v E V). 
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Taking a random walk v\, . . . , v m on G with uniform starting-point, we have for all e > 0, 



Pr 



i<m 



> em 



< 2e~ 



e 2 (l-X)m 



A more general claim was made earlier by Wigderson and Xiao [WX05J , but the proof contained 
an error, as pointed out in [WX08] . (A valid proof of the Theorem above, with different constants, 
can still be extracted from |WX05j .) 

For any r G {0, l} cn , let (J(r) := \{i £ [n] : L(Giw,i(r)) = 1}|. Theorem [131 implies the following 
concentration bound for the generator Giw- 

Lemma 14. Let L be an arbitrary language. Let c n = \L Pi {0, l} n |/2 n . Then for any fixed 5 > 0, 

PrIWr) — c n ■ n\ > Sn] < exp{— 0(n)}. 

r 

Proof. Recall that r = (r a , r&). We show that the above inequality is true after conditioning on any 
value of r&; this will prove the Lemma. Let (v[, . . . ,v' n ) = (K[(rb), . . . ,K' n (ri,)). Then for i G [n], 
Giw,i( r ) G L iff Ki(r a ) + v\ 6 L, or equivalently Ki(r a ) £ L n + v[ (where L n := L n {0, 1} and 
Ln + v[ = {x + Vi : x G Ln.}). 

Define /j : {0, l} n — > {0, 1} to be the characteristic function of L n + v[. Clearly m = c n for all 
i. The result now follows by a direct application of Theorem I13| using the fact that Q n has second 
eigenvalue bounded away from 1. □ 

Proof of Theorem\^ Our choice of £o, determined later, will be no larger than 1/6, so by Theo- 
rem [12j our hypothesis implies there is a 7' > 0, a c > 0, and a polynomial-time Giw '■ {0, l} cn — > 
{0, l} nxn , such that: for any circuit C : {0, l} n ^ {0, l} n of size at most 2^' n , 



P4C(r) = (L n oG IW )(r)]<T 



Let M(x,w) be a polynomial-time verifier for L: x € L iff there exists w such that M(x,w) = 1. 
Let t(n) = \w\ = 0(poly(n)). 

Let jj(r") be as defined after Theorem [TBI If C : {0, l} cn — > {0, l}*( n ) xn i s a circuit producing n 
strings w\, . . . ,w n , each of length t(n), define 

tto(r) := |{» S N : M(G Wji (r), ^) = 1}|. 

Claim 15. There exists 7" > such that the following holds. If C(r) : {0, l} cn -4 {0, l}*(™) xn i§ a 
circuit of size at most 2 7 n , iften 



Pr[(t(r) < MO +7"™] < 2" 



■7 n 



Proof (of ClaimUjfy. Let a > 0. Say C(r) : {0, l} cn -> {0, 1}*(™) X " is a circuit of size at most 
2 an , such that Pr r [JJ(r) < Hc'(r) + an] > 2~ an . Consider the following randomized procedure that 
attempts to compute (L n o G/p^)(r): 

• Let C{r) = (wx,...,w n ) G {0,l}*( n ) xn . Let I C [n] be the indices i for which M(G IW>i {r),Wi) = 
1. Pick a random subset J of [n], uniformly from the set of all subsets of size less than an 
(including the empty set). Output the characteristic vector of I U J. 
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We analyze this procedure. Suppose r is any input for which jj(r) < ttc( r ) + an - Note that by 
definition of M, we always have I Q {i & [n] : L(Giw,i( r )) = !}• Then there exists a J C [n] \ 5, 
of size less than an, such that / U J = {i G [n] : L(G\zw,i(r)) = 1}. Thus conditioned on this event, 
the procedure succeeds with probability at least v 1 > 2~ H( - a ^ n . So the overall success probability 

is at least 2~ an ■ 2~ H ^ n = 2~ { - a+H ^ n . 

Let us nonuniformly fix a setting J that maximizes the procedure's success probability, and use 
this choice to run the procedure. The result is a (nonuniform) circuit of size 2 an + 0(poly(n)), 
with success probability > 2~( a+H ( a ^ n . For a sufficiently small this contradicts the hardness of 
(L n o Giw), proving the claim. □ 

Now we set e := min(l/6,7"/4). Fix any circuit C : {0, l} cn — > {0, l}*( n ) xn of size at most 
2 7 n . We use Lemma [T41 applied to 5 := eq, and the previous Claim, to find that, with probability 
> 1 — exp{— fi(n)} over r, we have the simultaneous inequalities (c n + £o)n > (J(r) > (c n — £o)n and 
tt( r ) > ttc( r ) + j"n. Call a string r with this property C-typical. 

What is c n ? We claim it must lie in [1/2 — eq, 1/2 + sq\. For otherwise, a size-1 circuit could 
guess L(x) with probability greater than 1/2 + Eo by guessing the majority value on length n, 
contrary to our hardness assumption about L. Thus for a C-typical r, ttc( r ) < (l/2 + £o) n — l" n < 
(1/2 - 37"/4)n and also (J(r) > (1/2 - e )ra - > (1/2 - 7"/2)n. 

Defining n as some rational number in the interval (1/2 — 37"/4, 1/2 — j" /2), define a predicate 
Q(r,u;i,...,u; n ) : {0, l} cn x {0, 1}™ X *(") -> {0,1} as follows: Q{r,w u . . . ,w n ) = 1 iff for at least 
an r\ fraction of indices i we have M(Giw,i( r ), w%) = 1- Q is itself polynomial-time computable, 
computed by some uniform family {Q n }n>o of poly-size circuits. We have the key property that 
for a C-typical r, there exist Wi, . . . ,w n such that Q(r, w\, . . . , w n ) = 1, yet Q(r, C(r)) = 0. 

Invoke Lemma [9] with e := j"/(2c), yielding an algorithm A (and an associated v > 0). Then 
we claim {A(Q n )} n> o = {ip n (r, z)} n> o is the desired family of 2-CSPs (here |r| = cn, \z\ = d{n) = 
0(poly(n))). First we verify condition (|5ji). Consider any r for which there exists a w±, . . . ,w n 
such that Q n ( r ,wi, . . . ,w n ) = 1. By condition ([9li) of Lemma El we find that in this case there 
exists z such that Val^ n (r, z) = 1. Since all but an exp{— J7(n)} fraction of r have this property, 
condition ([5ji) is satisfied. 

To establish condition (|5lii), fix 72 as any value in (0,7") and let 6 := v. Suppose C(r) : 
{0,l} m {o,i} d ( n ) is a circuit of size at most 2 72?1 , such that with some probability q(n), 
Val^ (r, C(r)) > 1 — 9. By condition (J9jii) of Lemma [U there exists a circuit C(r) : {0, l} cri — > 
{0, l} nx ' (n) , such that with probability at least q(n) ■ 2~ £ ( cn ) over r, Q n (r,C(r)) = 1. Note that 
such an r fails to be C-typical. Moreover, C is of size at most |C| + 0(poly(n)), which for large 
enough n is less than 2 7 11 . 

So, by our previous analysis we find that q{n) - 2~ £cn < 2 -7 n , i.e., q(n) < 2( £C ~ 7 "> n = 2~ 7 n / 2 = 
exp{— Q(n)}. We have proved condition (|5jii). This completes the proof of Theorem □ 

Finally, we note that versions of Theorems H] and [5] can be proved, in which both the hypotheses 
and conclusions apply, not to general circuits, but to the class of TC° circuits (i.e., constant-depth 
Boolean circuits with majority gates), or any circuit class containing TC°. This is because all 
the reductions involved can be carried out in TC°. (For a discussion of why the Impagliazzo and 
Impagliazzo-Wigderson constructions amplify hardness in TC°, see Agrawal |Agr01| ; the difficulties 
in amplifying hardness in lower classes like AC were explored by Shaltiel and Viola [SV08J.) 
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7 Questions for Further Research 



Does our approximation problem remain prAM -complete if each variable in the CSP ip(r, z) is 
restricted to appear in only a constant number of constraints? (The 'expander-replacement' 
technique [PY914 Pap94| allows us to restrict the occurrences of z-variables in our prAM- 



completeness proof; it is the 'stochastic' r- variables which pose a challenge.) Alternatively, 
can one perhaps show that under this restriction the problem lies in NP? 

• Can we unconditionally disprove Hypothesis B? Given the sharp limitations of NC° circuits 
this might be possible. 

• Can PCP ideas be used to give new upper bounds on the class AM? 

• Find more applications of PCPPs in complexity theory. 
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